首页> 外文OA文献 >MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification
【2h】

MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification

机译:mEBoost:混合估算器与不均衡数据的提升   分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

Class imbalance problem has been a challenging research problem in the fieldsof machine learning and data mining as most real life datasets are imbalanced.Several existing machine learning algorithms try to maximize the accuracyclassification by correctly identifying majority class samples while ignoringthe minority class. However, the concept of the minority class instancesusually represents a higher interest than the majority class. Recently, severalcost sensitive methods, ensemble models and sampling techniques have been usedin literature in order to classify imbalance datasets. In this paper, wepropose MEBoost, a new boosting algorithm for imbalanced datasets. MEBoostmixes two different weak learners with boosting to improve the performance onimbalanced datasets. MEBoost is an alternative to the existing techniques suchas SMOTEBoost, RUSBoost, Adaboost, etc. The performance of MEBoost has beenevaluated on 12 benchmark imbalanced datasets with state of the art ensemblemethods like SMOTEBoost, RUSBoost, Easy Ensemble, EUSBoost, DataBoost.Experimental results show significant improvement over the other methods and itcan be concluded that MEBoost is an effective and promising algorithm to dealwith imbalance datasets. The python version of the code is available here:https://github.com/farshidrayhanuiu/
机译:由于大多数现实生活中的数据集不平衡,类不平衡问题一直是机器学习和数据挖掘领域中一个具有挑战性的研究问题。现有的几种机器学习算法都试图通过正确识别多数类样本而忽略少数类来最大化准确性分类。但是,少数派实例的概念通常比多数派代表更高的兴趣。最近,在文献中已经使用了几种成本敏感的方法,集成模型和采样技术,以对不平衡数据集进行分类。在本文中,我们提出了MEBoost,一种用于不平衡数据集的新提升算法。 MEBoost混合了两个不同的弱学习者,以提高其在不平衡数据集上的性能。 MEBoost是现有技术(例如SMOTEBoost,RUSBoost,Adaboost等)的替代方法。MEBoost的性能已在12种基准不平衡数据集上进行了评估,这些数据集具有SMOTEBoost,RUSBoost,Easy Ensemble,EUSBoost,DataBoost等最新集成方法。通过对其他方法的改进,可以得出结论,MEBoost是处理不平衡数据集的有效且有前途的算法。该代码的python版本在此处提供:https://github.com/farshidrayhanuiu/

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号